Beginning Portuguese corpus linguistics: exploring a corpus to teach Portuguese as a foreign language
نویسندگان
چکیده
منابع مشابه
A brazilian portuguese language corpus development
This article presents the techniques that are being used for the creation of a database related to the Brazilian Portuguese language. This database is composed of a collection of recorded voices, from different speakers and different regions of Brazil. The collected voices contain varied phonetic and phonologic information. The applications of this database are diverse, including synthesis and ...
متن کاملCorpus linguistics and second / foreign language learning : exploring multiple paths
The aim of this article is twofold: first, to briefly assess the influence that corpus linguistic research has had on second/foreign language learning so far, and second, to suggest future directions for a more coherent and well thought out integration of corpora in instructed settings. In section 1, the influence of native and learner corpus research on second/foreign language learning will be...
متن کاملThe COPLE2 corpus: a learner corpus for Portuguese
We present the COPLE2 corpus, a learner corpus of Portuguese that includes written and spoken texts produced by learners of Portuguese as a second or foreign language. The corpus includes at the moment a total of 182,474 tokens and 978 texts, classified according to the CEFR scales. The original handwritten productions are transcribed in TEI compliant XML format and keep record of all the origi...
متن کاملTimeBankPT: A TimeML Annotated Corpus of Portuguese
In this paper, we introduce TimeBankPT, a TimeML annotated corpus of Portuguese. It has been produced by adapting an existing resource for English, namely the data used in the first TempEval challenge. TimeBankPT is the first corpus of Portuguese with rich temporal annotations (i.e. it includes annotations not only of temporal expressions but also about events and temporal relations). In additi...
متن کاملCorpus linguistics meets language technology:
To the extent that NLP is used by QA systems, it is mostly limited to tokenization, named entity recognition, stemming, POS tagging, and shallow parsing. More sophisticated NLP such as (deep) syntactic parsing is hardly ever used. In the present paper I investigate why this should be the case and try to establish how deep syntactic parsing as developed in the field of corpus linguistics might c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: DELTA: Documentação de Estudos em Lingüística Teórica e Aplicada
سال: 1999
ISSN: 0102-4450
DOI: 10.1590/s0102-44501999000200003